CL2365USNA.ST25 
SEQUENCE LISTING 

<110> E.I. du Pont deNemours and Co., Inc. 
Cheng, Qiong 
Tao, Luan 
sedkova, Natalia 

<120> GENES ENCODING CAROTENOID COMPOUNDS 

<130> CL2365 US NA 

<150> US 60/477,874 
<151> 2003-06-12 

<150> US 60/527,083 
<151> 2003-12-03 

<160> 20 

<170> Patentln version 3.2 

<210> 1 

<211> 906 

<212> DNA 

<213> Pantoea agglomerans strain DC404 



<400> 1 
atgacaagac 


cctttgaaac 


acatcccggt 


cacgacgggg 


aactgcatga 


gctgcacgct 


60 


gccctgcaac 


gtcgcctgga 


tgaactgctg 


cccgttggcg 


atgagcggga 


tcgggtcagc 


120 


agcgcaatgc 


gcgaaggcgt 


actggcaccg 


gggaaacgca 


ttcgcccgct 


gctcctgatc 


180 


ctcgccgccc 


gcgacctcgg 


ctgcgatcgc 


gaccaccccg 


gcctgctgga 


tatggcctgt 


240 


gcggtggaaa 


tggtgcacgc 


ctcgtcgctg 


atcctcgacg 


atattccctg 


catggataac 


300 


gcggcgctcc 


ggcgcggtcg 


ccctaccatt 


catcgccagt 


atggtgaaga 


cgtggcaatt 


360 


ctcgctgcgg 


tagcgttgct 


cagcagcgcc 


tttggcgtga 


tggtcgcggc 


gcagggattg 


420 


tctcccgagt 


gccgcagcca 


ggcggtggcg 


gagctgtcga 


tggcggtcgg 


tacccagggt 


480 


ctggtgcagg 


gtcagtataa 


ggatctgcgt 


gaaggcaccg 


ccccgcgcag 


cgccgaggag 


540 


atcgccacca 


ccaacgaact 


gaaaaccagc 


gtgctgtttg 


gtgccacgct 


gcaaatcgcg 


600 


gccctggcgg 


caggcgcctc 


gccggcggcg 


cgccagaaaa 


tgcgctgctt 


tgcgcaggat 


660 


ttaggccagg 


cgttccagct 


gctggacgat 


ctggcggacg 


gccatgccgg 


gaccggcaaa 


720 


gacatcaata 


aggacgcggg 


taagtccacg 


ctggtggcga 


tgctcggcag 


cgacgcggtg 


780 


cgcgagcggc 


tcgacaccca 


tctgcgccgc 


gcagacgccc 


atttttcacg 


cgcctgcgga 


840 


aaaaaccagg 


ccacgcgacg 


ctttatgcac 


gcctggtttt 


caaaacagct 


ggccgcgttt 


900 


agctga 












906 



<210> 2 
<211> 301 
<212> PRT 

<213> Pantoea agglomerans strain DC404 
<400> 2 

Met Thr Arg Pro Phe Glu Thr His Pro Gly His Asp Gly Glu Leu His 
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CL2365USNA.ST25 
15 10 15 

Glu Leu His Ala Ala Leu Gin Arg Arg Leu Asp Glu Leu Leu Pro val 
20 25 30 

Gly Asp Glu Arg Asp Arg val ser Ser Ala Met Arg Glu Gly val Leu 
35 40 45 

Ala Pro Gly Lys Arg lie Arg Pro Leu Leu Leu lie Leu Ala Ala Arg 
50 55 60 

Asp Leu Gly Cys Asp Arg Asp His Pro Gly Leu Leu Asp Met Ala Cys 
65 70 75 80 

Ala val Glu Met val His Ala Ser ser Leu lie Leu Asp Asp lie Pro 
85 90 95 

Cys Met Asp Asn Ala Ala Leu Arg Arg Gly Arg Pro Thr lie His Arg 
100 105 110 

Gin Tyr Gly Glu Asp val Ala lie Leu Ala Ala val Ala Leu Leu Ser 
115 120 125 

ser Ala Phe Gly val Met val Ala Ala Gin Gly Leu ser Pro Glu Cys 
130 135 140 

Arg Ser Gin Ala val Ala Glu Leu Ser Met Ala val Gly Thr Gin Gly 
145 150 155 160 

Leu Val Gin Gly Gin Tyr Lys Asp Leu Arg Glu Gly Thr Ala Pro Arg 
165 170 175 

Ser Ala Glu Glu lie Ala Thr Thr Asn Glu Leu Lys Thr Ser Val Leu 
180 185 190 

Phe Gly Ala Thr Leu Gin He Ala Ala Leu Ala Ala Gly Ala Ser Pro 
195 200 205 

Ala Ala Arg Gin Lys Met Arg Cys Phe Ala Gin Asp Leu Gly Gin Ala 
210 215 220 

Phe Gin Leu Leu Asp Asp Leu Ala Asp Gly His Ala Gly Thr Gly Lys 
225 230 235 240 

Asp lie Asn Lys Asp Ala Gly Lys Ser Thr Leu val Ala Met Leu Gly 
245 250 255 

Ser Asp Ala val Arg Glu Arg Leu Asp Thr His Leu Arg Arg Ala Asp 
260 265 270 

Ala His Phe Ser Arg Ala Cys Gly Lys Asn Gin Ala Thr Arg Arg Phe 
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CL2365USNA.ST25 
275 280 285 

Met His Ala Trp Phe Ser Lys Gin Leu Ala Ala Phe Ser 
290 295 300 

<210> 3 

<211> 1050 

<212> DNA 

<213> Pantoea agglomerans strain DC404 



<400> 3 
atgaaggacg 


cgcatctggt 


tcagcgtaaa 


aatgaccacc 


tggatatcgt 


gctgcaccct 


60 


gaccgggcga 


tgagtaccat 


tcgcaccgga 


tttgacgcct 


ggcgttttga 


acactgcgcc 


120 


ctcccggagc 


tggatctcga 


cggtatcgat 


ctctccacca 


ccctgttttc 


ccgcccgctg 


180 


aaagccccgg 


tgctgatcag 


ctccatgacc 


ggcggcgcgg 


cgcgcgccag 


agacattaac 


240 


cgtcatctgg 


cccaggcggc 


gcaaaccctt 


gggctggcga 


tgggcgtcgg 


ttcccagcgc 


300 


gtggcgctgg 


aggacggcgc 


gcagcacggg 


ctggatgccc 


agctacgcca 


tatcgccccg 


360 


gacgtgccgc 


tgctggctaa 


ccttggcgcg 


gcgcagatcc 


gcggtgcgca 


ggggctggac 


420 


tacgcccggc 


gcgcggtgga 


catgatcgac 


gccgacgcgt 


taattgtgca 


tctgaacccg 


480 


ctgcaggagg 


cgctccaggg 


cggcggcgat 


cgcgactggc 


gcggcatcct 


caacgccatt 


540 


gcgcagctgg 


tgcgcgacct 


gccggtaccg 


gtggtggtta 


aagaggtggg 


cgccgggatc 


600 


tccccggacg 


ttgcctgccg 


actggcggac 


gtcggcgtgg 


cgatgatcga 


cattgccggc 


660 


gcgggcggaa 


ccagctgggc 


ggcggtggaa 


gctgaacgcg 


ccccgacccc 


cgaggcgcga 


720 


aatgtggcga 


tggcctttgc 


cgactggggc 


attcctactg 


ccgatgcgct 


gcgtcgcgtc 


780 


catcttgcgc 


tgcctgatat 


cccgcttatc 


gcctccggcg 


gcatcgccaa 


cggcattgac 


840 


gcagcaaaag 


ccatcgcgct 


gggtgcagat 


ctggtgggcc 


aggccgcggc 


ggtgctggcg 


900 


catgccaacg 


cctccggcga 


cgcggcaatt 


gcccatttcc 


gcaccctgat 


tacgcagctg 


960 


cggatcgcct 


gtttctgtac 


cggcagtgca 


aacctgcagg 


cgttgcgaca 


cgccacgctg 


1020 


cttccggtca 


acggcggcgc 


atccctgtga 








1050 



<210> 4 
<211> 349 
<212> PRT 

<213> Pantoea agglomerans strain DC404 
<400> 4 

Met Lys Asp Ala His Leu val Gin Arg Lys Asn Asp His Leu Asp lie 
15 10 15 

Val Leu His Pro Asp Arg Ala Met Ser Thr lie Arg Thr Gly Phe Asp 
20 25 30 

Ala Trp Arg Phe Glu His Cys Ala Leu Pro Glu Leu Asp Leu Asp Gly 
35 40 45 
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CL2365USNA.ST25 

lie Asp Leu Ser Thr Thr Leu Phe ser Arg Pro Leu Lys Ala Pro val 
50 5 5 60 

Leu lie Ser ser Met Thr Gly Gly Ala Ala Arg Ala Arg Asp lie Asn 
65 70 75 80 

Arg His Leu Ala Gin Ala Ala Gin Thr Leu Gly Leu Ala Met Gly Val 
85 90 95 

Gly Ser Gin Arg Val Ala Leu Glu Asp Gly Ala Gin His Gly Leu Asp 
100 105 110 

Ala Gin Leu Arg His lie Ala Pro Asp Val Pro Leu Leu Ala Asn Leu 
115 120 125 

Gly Ala Ala Gin He Arg Gly Ala Gin Gly Leu Asp Tyr Ala Arg Arg 
130 135 140 

Ala val Asp Met lie Asp Ala Asp Ala Leu lie val His Leu Asn Pro 
145 150 155 160 

Leu Gin Glu Ala Leu Gin Gly Gly Gly Asp Arg Asp Trp Arg Gly lie 
165 170 175 

Leu Asn Ala lie Ala Gin Leu val Arg Asp Leu Pro val Pro val Val 
180 185 190 

val Lys Glu val Gly Ala Gly lie Ser Pro Asp Val Ala Cys Arg Leu 
195 200 205 

Ala Asp Val Gly val Ala Met lie Asp He Ala Gly Ala Gly Gly Thr 
210 215 220 

Ser Trp Ala Ala Val Glu Ala Glu Arg Ala Pro Thr Pro Glu Ala Arg 
225 230 ~ 235 240 

Asn val Ala Met Ala Phe Ala Asp Trp Gly lie Pro Thr Ala Asp Ala 
245 250 255 

Leu Arg Arg Val His Leu Ala Leu Pro Asp lie Pro Leu lie Ala Ser 
260 265 270 

Gly Gly lie Ala Asn Gly lie Asp Ala Ala Lys Ala lie Ala Leu Gly 
275 280 285 

Ala Asp Leu val Gly Gin Ala Ala Ala val Leu Ala His Ala Asn Ala 
290 295 300 

Ser Gly Asp Ala Ala lie Ala His Phe Arg Thr Leu lie Thr Gin Leu 
305 310 315 320 
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CL2365USNA.ST25 

Arg lie Ala Cys Phe Cys Thr Gly Ser Ala Asn Leu Gin Ala Leu Arg 
325 330 335 

His Ala Thr Leu Leu Pro Val Asn Gly Gly Ala Ser Leu 
340 345 

<210> 5 
<211> 1170 
<212> DNA 

<213> Pantoea agglomerans strain DC404 



<400> 5 
atgaaaaaat 


gggatctgat 


tctggtcggc 


gcggggctgg 


ccaacgggct 


tatcgcctgg 


60 


cgactaaagc 


agcgtcatcc 


gacgcttgct 


gtattaatgc 


tggagtgcgg 


cgacgcgccc 


120 


ggcggaaacc 


acacctggtc 


ctttcaccaa 


cacgatatca 


cgccagccca 


gcacgcctgg 


180 


ctggcgccgc 


tggtggccca 


tcgctgggac 


gggtacgacg 


tccactttcc 


gaacgtgtcg 


240 


cgcaccctgc 


atgacggcta 


cctgaccatc 


acctccacgc 


gttttgccca 


agcgatgcgc 


300 


gggctgatga 


aagagaattt 


gctgacaaac 


gtgaccgtgt 


cacgggtgag 


cgggcaggaa 


360 


gtaaccctca 


gcgacggacg 


acgctttacc 


gccggggcgg 


tgattgatgg 


ccgcggctat 


420 


cagccctcgc 


cgcacctcag 


cattggctat 


caggcgttca 


tcggccagga 


gtggcaactg 


480 


accgcgcccc 


acgggttaac 


gcgcccgatc 


ctgatggatg 


cccgcgtcgc 


ccagggcaac 


540 


ggctaccgct 


ttgtctatac 


cctgccgctc 


agcgccgaca 


ccctgcttat 


cgaagacacg 


600 


cactacattg 


acggcccgac 


gctcgacgcc 


gattcagccc 


gcgcgcggat 


tgccgattac 


660 


gcccgccagc 


agggctggca 


gcttgcgcgg 


ctggtgcgtg 


aggaacaggg 


ggcgctgccg 


720 


atcaccctgt 


ccggcgatcc 


ggccgccttc 


tggcaccagt 


tccatcatca 


gccggtcagc 


780 


ggcctgcgcg 


ccggtctgtt 


ccatgccacc 


accggctatt 


cgctgccgct 


ggcggttcgg 


840 


ctggcggacc 


gcattgccaa 


cgcgccggga 


ctgcatcagg 


gcgcgctcta 


tcagctgatc 


900 


gccgatttcg 


cggcgcgcca 


ctggcagaca 


caacgctttt 


tccgcctgct 


taaccgcatg 


960 


cttttcctgg 


ccggcacacc 


cgaccagcgc 


tggcgcgtga 


tgcagcggtt 


ttaccagctt 


1020 


gacgagcagc 


tgatcgcccg 


tttttatgcc 


ggccagcttc 


gctccgccga 


ccgcgcgcgc 


1080 


ctgctgcttg 


gcaaaccgcc 


ggtgccgatt 


gtcggggcga 


tcaaagccct 


gctccacact 


1140 


cattcttctc 


tgcgagccca 


tcataaatga 








1170 



<210> 6 
<211> 389 
<212> PRT 

<213> Pantoea agglomerans strain DC404 
<400> 6 

Met Lys Lys Trp Asp Leu lie Leu val Gly Ala Gly Leu Ala Asn Gly 
15 10 15 



Leu lie Ala Trp Arg Leu Lys Gin Arg His Pro Thr Leu Ala val Leu 
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CL2365USNA.ST25 
20 25 30 

Met Leu Glu Cys Gly Asp Ala Pro Gly Gly Asn His Thr Trp ser Phe 
35 40 45 

His Gin His Asp lie Thr Pro Ala Gin His Ala Trp Leu Ala Pro Leu 
50 55 60 

val Ala His Arg Trp Asp Gly Tyr Asp val His Phe Pro Asn val Ser 
65 70 75 80 

Arg Thr Leu His Asp Gly Tyr Leu Thr lie Thr Ser Thr Arg Phe Ala 
85 90 95 

Gin Ala Met Arg Gly Leu Met Lys Glu Asn Leu Leu Thr Asn val Thr 
100 105 110 

val Ser Arg val ser Gly Gin Glu val Thr Leu Ser Asp Gly Arg Arg 
115 120 125 

Phe Thr Ala Gly Ala Val lie Asp Gly Arg Gly Tyr Gin Pro Ser Pro 
130 135 140 

His Leu ser lie Gly Tyr Gin Ala Phe lie Gly Gin Glu Trp Gin Leu 
145 150 155 160 

Thr Ala Pro His Gly Leu Thr Arg Pro lie Leu Met Asp Ala Arg val 
165 " 170 175 

Ala Gin Gly Asn Gly Tyr Arg Phe Val Tyr Thr Leu Pro Leu Ser Ala 
180 ~ 185 190 

Asp Thr Leu Leu lie Glu Asp Thr His Tyr lie Asp Gly Pro Thr Leu 
195 200 205 

Asp Ala Asp ser Ala Arg Ala Arg lie Ala Asp Tyr Ala Arg Gin Gin 
210 215 220 

Gly Trp Gin Leu Ala Arg Leu val Arg Glu Glu Gin Gly Ala Leu Pro 
225 230 235 240 

lie Thr Leu ser Gly Asp Pro Ala Ala Phe Trp His Gin Phe His His 
245 250 255 

Gin Pro val Ser Gly Leu Arg Ala Gly Leu Phe His Ala Thr Thr Gly 
260 265 270 

Tyr ser Leu Pro Leu Ala val Arg Leu Ala Asp Arg lie Ala Asn Ala 
275 280 285 

Pro Gly Leu His Gin Gly Ala Leu Tyr Gin Leu lie Ala Asp Phe Ala 

Page 6 



CL2365U5NA.ST25 
300 



Ala Arg His Trp Gin Thr Gin Arg Phe Phe Arg Leu Leu Asn Arg Met 
305 310 315 320 

Leu Phe Leu Ala Gly Thr Pro Asp Gin Arg Trp Arg Val Met Gin Arg 

325 330 335 

Phe Tyr Gin Leu Asp Glu Gin Leu lie Ala Arg Phe Tyr Ala Gly Gin 
340 345 350 

Leu Arg Ser Ala Asp Arg Ala Arg Leu Leu Leu Gly Lys Pro Pro Val 
355 360 365 

Pro lie val Gly Ala lie Lys Ala Leu Leu His Thr His Ser Ser Leu 
370 375 380 

Arg Ala His His Lys 
385 

<210> 7 
<211> 1479 
<212> DNA 

<213> Pantoea agglomerans strain DC404 



<400> 7 
atgaaacaaa 


ccattgtaat 


tggcgccggg 


ttcggcggac 


tggcgctggc 


gattcgcctc 


60 


caggcggcgg 


gcattcctac 


cacgctgctg 


gagagccgcg 


acaaacccgg 


cggccgcgcc 


120 


tatgtctacg 


aagatcgcgg 


ctttaccttt 


gatgcgggtc 


ccaccgtcat 


caccgatccc 


180 


tccgccattg 


aggagctgtt 


caccctcgcc 


ggaaaacggc 


tgaaggacta 


cgttgagctg 


240 


atgccggtga 


cgccgttcta 


tcgcctgtgc 


tgggaagacg 


gcaaggtttt 


cgactacgcc 


300 


aacgatcagg 


cggcgcttga 


gtcgcagatc 


gccgcgttta 


acccgaacga 


cgtggcgggc 


360 


tatcaccgct 


tcctcgacta 


ctcccgggcg 


gtgtttgccg 


aaggctatct 


gaagctcggc 


420 


gcggtgccgt 


ttctctcgtt 


tcgcgacatg 


ctgcgcgccg 


gtcctcaact 


ggcgcggctg 


480 


caggcatggc 


gcagcgtgta 


cgacaaagtg 


tcggcctacg 


tggaagacga 


gcacctgcgg 


540 


caggcatttt 


cgtttcactc 


gctgctggtg 


ggcggcaacc 


cgttctccac 


gtcttctatt 


600 


tacaccctga 


tccacgccct 


ggagcgggaa 


tggggcgtct 


ggttcccgcg 


cggcggcacc 


660 


ggtgcgctgg 


ttcagggcat 


ggtgaagctg 


tttcaggatc 


ttggcggcac 


cctcaccctt 


720 


aacgctcagg 


ttgagcggct 


ggagacggtg 


gacaatcagg 


tgaaggccgt 


gcatctggtt 


780 


aacgggcagc 


ggctggaggc 


tgcggcggtg 


gcctcgaacg 


cggacgtggt 


aaatacctat 


840 


gcccgactgc 


tcggccatca 


cccgcacggc 


gccgctacgg 


ccaaaaagct 


gaaacgcaag 


900 


cgcatgagca 


actcgctgtt 


cgtgctctat 


tttggcctgg 


atcaccatca 


cacccagctg 


960 


gcgcaccata 


ccgtctgctt 


tggcccgcgt 


tataaagcgc 


taatcgatga 


aattttcagc 


1020 


gccgacaccc 


tgtcggaaga 


tttttcgctc 


tatctgcatg 
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cgccctgcgt 


aaccgacccg 


1080 



CL2365USNA.5T25 

tcgctggccc cgccggggtg cggcagctac tatgtgctcg cgccggtgcc gcacctcggt 1140 

aacgccccgc tcgactggag cgtggaaggg ccgcgtctgc gggatcgcat ttttgattat 1200 

ctcgaagcgc gctatatgcc ggggctgcgc tcccagctgg tgacgcaccg catgttcacg 1260 

ccggaagatt ttcgcgatac gctcgatgcc tggcaggggt cagcgttttc actggagccg 1320 

atcctcaccc agagcgcctg gttccggccg cacaaccgcg acagcgtggt tgataacctc 1380 

tacctggtcg gcgccggaac gcatcccggc gctggcgtgc cgggcgtgat cggatccgcc 1440 

aaggcaacgg cccagttaat gttaaaggat ttagcgtaa 1479 

<210> 8 

<211> 492 

<212> PRT 

<213> Pantoea agglomerans strain DC404 

<400> 8 

Met Lys Gin Thr lie Val lie Gly Ala Gly Phe Gly Gly Leu Ala Leu 
1 5 10 15 

Ala lie Arg Leu Gin Ala Ala Gly lie Pro Thr Thr Leu Leu Glu Ser 

20 25 30 

Arg Asp Lys Pro Gly Gly Arg Ala Tyr val Tyr Glu Asp Arg Gly Phe 
35 40 45 

Thr Phe Asp Ala Gly Pro Thr val lie Thr Asp Pro Ser Ala lie Glu 
50 55 60 

Glu Leu Phe Thr Leu Ala Gly Lys Arg Leu Lys Asp Tyr Val Glu Leu 
65 70 75 80 

Met Pro val Thr Pro Phe Tyr Arg Leu Cys Trp Glu Asp Gly Lys Val 
85 90 95 

Phe Asp Tyr Ala Asn Asp Gin Ala Ala Leu Glu Ser Gin lie Ala Ala 
100 105 110 

Phe Asn Pro Asn Asp val Ala Gly Tyr His Arg Phe Leu Asp Tyr Ser 
115 120 125 

Arg Ala val Phe Ala Glu Gly Tyr Leu Lys Leu Gly Ala Val Pro Phe 
130 135 140 

Leu ser Phe Arg Asp Met Leu Arg Ala Gly Pro Gin Leu Ala Arg Leu 
145 150 155 160 

Gin Ala Trp Arg ser val Tyr Asp Lys val Ser Ala Tyr val Glu Asp 
165 170 175 



Glu His Leu Arg Gin Ala Phe ser Phe His Ser Leu Leu Val Gly Gly 
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CL2365USNA.ST25 
180 185 190 

Asn Pro Phe ser Thr Ser Ser lie Tyr Thr Leu lie His Ala Leu G"lu 
195 200 205 

Arg Glu Trp Gly val Trp Phe Pro Arg Gly Gly Thr Gly Ala Leu Val 
210 215 220 

Gin Gly Met Val Lys Leu Phe Gin Asp Leu Gly Gly Thr Leu Thr Leu 
225 230 235 240 

Asn Ala Gin val Glu Arg Leu Glu Thr val Asp Asn Gin val Lys Ala 
245 250 255 

val His Leu val Asn Gly Gin Arg Leu Glu Ala Ala Ala val Ala Ser 
260 265 270 

Asn Ala Asp Val Val Asn Thr Tyr Ala Arg Leu Leu Gly His His Pro 
275 280 285 

His Gly Ala Ala Thr Ala Lys Lys Leu Lys Arg Lys Arg Met Ser Asn 
290 295 300 

Ser Leu Phe val Leu Tyr Phe Gly Leu Asp His His His Thr Gin Leu 
305 310 315 320 

Ala His His Thr val Cys Phe Gly Pro Arg Tyr Lys Ala Leu He Asp 
325 330 335 

Glu lie Phe Ser Ala Asp Thr Leu Ser Glu Asp Phe Ser Leu Tyr Leu 
340 345 350 

His Ala Pro Cys Val Thr Asp Pro Ser Leu Ala Pro Pro Gly Cys Gly 
355 360 365 

Ser Tyr Tyr val Leu Ala Pro val Pro His Leu Gly Asn Ala Pro Leu 
370 375 380 

Asp Trp Ser Val Glu Gly Pro Arg Leu Arg Asp Arg lie Phe Asp Tyr 
385 390 395 400 

Leu Glu Ala Arg Tyr Met Pro Gly Leu Arg Ser Gin Leu val Thr His 
405 410 415 

Arg Met Phe Thr Pro Glu Asp Phe Arg Asp Thr Leu Asp Ala Trp Gin 
420 425 430 

Gly Ser Ala Phe Ser Leu Glu Pro lie Leu Thr Gin Ser Ala Trp Phe 
435 440 445 

Arg Pro His Asn Arg Asp Ser Val val Asp Asn Leu Tyr Leu val Gly 

Page 9 



CL2365USNA.ST25 
455 460 



Ala Gly Thr His Pro Gly Ala Gly Val Pro Gly val lie Gly ser Ala 
465 470 475 480 



Lys Ala Thr Ala Gin Leu Met Leu Lys Asp Leu Ala 
485 490 



<210> 9 

<211> 927 

<212> DNA 

<213> Pantoea agglomerans strain DC404 



<400> 9 
atgtcccagc 


cgcttctcga 


acacgccagc 


gccaccatga 


ccgccggttc 


taaaagtttc 


60 


gccaccgcct 


caaagctgtt 


tgacaaacgc 


acccggcgca 


gcgcgctgat 


gctctatacc 


120 


tggtgccgct 


actgcgacga 


tgttatcgac 


ggacaggtgg 


tgggttttgc 


tgccccgacc 


180 


gagcagagcg 


acacgcccga 


ggcgcgcctg 


caacggctgc 


gtaagatgac 


gcgccgcgcc 


240 


tacgacgggg 


aaaccatgca 


agagccgccg 


ttcgccgcct 


ttcaggaggt 


tgccctcgcc 


300 


catgccattc 


cgcctactca 


ggccttcgac 


cacctggaag 


gctatgcgat 


ggacgtgcgc 


360 


aacgagcgct 


attacagcct 


cgatgatacg 


ctccgctact 


gttatcacgt 


ggcgggcgtg 


420 


gtcggcctga 


tgatggccag 


ggtgatggga 


gtgcgggacg 


aagccacgct 


ggatcgcgcc 


480 


tgcgatctgg 


gcattgcctt 


tcagctcacc 


aatatcgcca 


gggatatcgt 


tgacgatgcg 


540 


caggtgggac 


gctgctacct 


gccgcagcag 


tggctggcgg 


aagtcggact 


caatgaacag 


600 


acctgcaccg 


tgcgggccaa 


ccgtccggcg 


ctggcgcgtc 


tggcagcgcg 


gctggtgacc 


660 


gaggctgagc 


cctattatca 


gtcagcgctt 


gccgggctgg 


gggatctgcc 


cctgcgctcc 


720 


gcctgggcga 


ttgccaccgc 


gcacggggtg 


tatcgtgaga 


tcggggtgaa 


ggtgctgatg 


780 


gcgggtgaaa 


aagcatggga 


tacccgccag 


ggcacgacgc 


gcgcggagaa 


gctggcgctg 


840 


gttatttccg 


gcgcgaagca 


ggcgatggct 


tcccggaagg 


cgagctggcc 


gccgcgcgat 


900 


ccgcacctct 


ggcagcgccc 


gcgctag 








927 



<210> 10 
<211> 308 
<212> PRT 

<213> Pantoea agglomerans strain DC404 
<400> 10 

Met Ser Gin Pro Leu Leu Glu His Ala Ser Ala Thr Met Thr Ala Gly 
15 10 15 

Ser Lys Ser Phe Ala Thr Ala Ser Lys Leu Phe Asp Lys Arg Thr Arg 
20 25 30 

Arg Ser Ala Leu Met Leu Tyr Thr Trp Cys Arg Tyr Cys Asp Asp Val 
35 40 45 
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lie Asp Gly Gin val val Gly Phe Ala Ala Pro Thr Glu Gin Ser Asp 
50 55 60 

Thr Pro Glu Ala Arg Leu Gin Arg Leu Arg Lys Met Thr Arg Arg Ala 
65 70 75 80 

Tyr Asp Gly Glu Thr Met Gin Glu Pro Pro Phe Ala Ala Phe Gin Glu 
85 90 95 

val Ala Leu Ala His Ala lie Pro Pro Thr Gin Ala Phe Asp His Leu 
100 105 110 

Glu Gly Tyr Ala Met Asp val Arg Asn Glu Arg Tyr Tyr Ser Leu Asp 
115 120 125 

Asp Thr Leu Arg Tyr Cys Tyr His val Ala Gly Val Val Gly Leu Met 
130 135 140 

Met Ala Arg Val Met Gly Val Arg Asp Glu Ala Thr Leu Asp Arg Ala 
145 " 150 155 160 

Cys Asp Leu Gly lie Ala Phe Gin Leu Thr Asn lie Ala Arg Asp lie 
165 170 175 

val Asp Asp Ala Gin Val Gly Arg Cys Tyr Leu Pro Gin Gin Trp Leu 
180 185 190 

Ala Glu val Gly Leu Asn Glu Gin Thr Cys Thr val Arg Ala Asn Arg 
195 200 205 

Pro Ala Leu Ala Arg Leu Ala Ala Arg Leu Val Thr Glu Ala Glu Pro 

210 215 ~ 220 

Tyr Tyr Gin Ser Ala Leu Ala Gly Leu Gly Asp Leu Pro Leu Arg ser 
225 230 235 240 

Ala Trp Ala lie Ala Thr Ala His Gly val Tyr Arg Glu lie Gly val 
245 250 255 

Lys val Leu Met Ala Gly Glu Lys Ala Trp Asp Thr Arg Gin Gly Thr 
260 265 270 

Thr Arg Ala Glu Lys Leu Ala Leu val lie Ser Gly Ala Lys Gin Ala 
275 280 285 

Met Ala ser Arg Lys Ala Ser Trp Pro Pro Arg Asp Pro His Leu Trp 
290 295 300 

Gin Arg Pro Arg 
305 
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<210> 11 

<211> 558 

<212> DNA 

<213> Pantoea agglomerans strain DC404 

<400> 11 



atgcttgcgt 


tgtggaatac 


cgggatcgtg 


ctactgacta 


tcatcatcat 


ggaaggggtg 


60 


gcaacgttcg 


cacacaagta 


catcatgcac 


qqctqqqqat 


qqqqctqqca 


tcattcgcac 


120 


cataccccgc 


gcaaaggggc 


gtttgagcgt 


aacgatctct 


atgcggtggt 


gtttgcgcta 


1 OA 

loll 


ctggccattg 


cgctgattta 


cgcgggcagc 


gaagggtact 


ggccgcttca 


gtggattggc 


240 


gcgggaatga 


ccggctacgg 


cgtgatctac 


tttatcgttc 


acgatggttt 


agtccaccag 


300 


cgctggccgt 


tccgttacgt 


gccgcgccgc 


ggctatctgc 


gccgcctcta 


catggcacac 


360 


cggctgcatc 


acgcggtgcg 


ggggcgcgaa 


gggtgcgtct 


ccttcgggtt 


tatctacgcc 


420 


ccaccggtgg 


acaagctgca 


ggcggtgctg 


cgcgaacgta 


acggcagacc 


cgctagcgcg 


480 


ggcgctgcca 


gaggtgcgga 


tcgcgcggcg 


gccagctcgc 


cttccgggaa 


gccatcgcct 


540 


gcttcgcgcc 


ggaaataa 










558 



<210> 12 
<211> 185 
<212> PRT 

<213> Pantoea agglomerans strain DC404 
<400> 12 

Met Leu Ala Leu Trp Asn Thr Gly lie Val Leu Leu Thr lie lie lie 
15 10 15 

Met Glu Gly val Ala Thr Phe Ala His Lys Tyr lie Met His Gly Trp 
20 25 30 

Gly Trp Gly Trp His His Ser His His Thr Pro Arg Lys Gly Ala Phe 
35 40 45 

Glu Arg Asn Asp Leu Tyr Ala Val val Phe Ala Leu Leu Ala lie Ala 
50 55 60 

Leu lie Tyr Ala Gly Ser Glu Gly Tyr Trp Pro Leu Gin Trp lie Gly 
65 70 75 80 

Ala Gly Met Thr Gly Tyr Gly val lie Tyr Phe lie val His Asp Gly 
85 90 95 

Leu val His Gin Arg Trp Pro Phe Arg Tyr val Pro Arg Arg Gly Tyr 
100 ~ 105 110 

Leu Arg Arg Leu Tyr Met Ala His Arg Leu His His Ala Val Arg Gly 
115 120 125 



Arg Glu Gly Cys val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp 
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135 140 



Lys Leu Gin Ala val Leu Arg Glu Arg Asn Gly Arg Pro Ala Ser Ala 
145 150 155 160 

Gly Ala Ala Arg Gly Ala Asp Arg Ala Ala Ala Ser Ser Pro Ser Gly 
165 170 175 

Lys Pro Ser Pro Ala Ser Arg Arg Lys 
180 185 

<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer HK12 
<400> 13 

gagtttgatc ctggctcag 19 

<210> 14 

<211> 15 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 3CR14 

<400> 14 

acgggcggtg tgtac 15 

<210> 15 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 3CR15 

<400> 15 

gccagcagcc gcggta 16 

<210> 16 
<211> 1324 
<212> DNA 

<213> Pantoea agglomerans strain DC404 
<400> 16 

atgacgctgg cggcaggcct aacacatgca agtcgaacgg taacaggaag cagcttgctg 60 

cttcgctgac gagtggcgga cgggtgagta atgtctgggg atctgcctga tggaggggga 120 

taactactgg aaacggtagc taataccgca taacgtcgca agaccaaaga gggggacctt 180 

cgggcctctt gccatcagat gaacccagat gggattagct agtaggtggg gtaacggctc 240 

acctaggcga cgatccctag ctggtctgag aggatgacca gccacactgg aactgagaca 300 

cggtccagac tcctacggga ggcagcagtg gggaatattg cacaatgggc gcaagcctga 360 
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.ST25 






tgcagccatg 


ccgcgtgtat 


gaagaaggcc 


ttcgggttgt 


aaagtacttt 


cagcgaggag 


420 


gaaggtggta 


aggttaataa 


ccttatcaat 


tgacgttact 


cgcagaagaa 


gcaccggcta 


480 


actccgtgcc 


agcagccgcg 


gtaatacgga 


gggtgcaagc 


gttaatcgga 


attactgggc 


540 


gtaaagcgca 


cgcaggcggt 


ctgtcaagtc 


ggatgtgaaa 


tccccgggct 


caacctggga 


600 


actgcattcg 


aaactggcag 


gctagagtct 


tgtagagggg 


ggtagaattc 


caggtgtagc 


660 


ggtgaaatgc 


gtagagatct 


ggaggaatac 


cggtggcgaa 


ggcggccccc 


tggacaaaga 


720 


ctgacgctca 


ggtgcgaaag 


cgtggggagc 


aaacaggatt 


agataccctg 


gtagtccacg 


780 


ccgtaaacga 


tgtcgacttg 


gaggttgttc 


ccttgaggag 


tggcttccgg 


agctaacgcg 


840 


ttaagtcgac 


cgcctgggga 


gtacggccgc 


aaggttaaaa 


ctcaaatgaa 


ttgacggggg 


900 


cccgcacaag 


cggtggagca 


tgtggtttaa 


ttcgatgcaa 


cgcgaagaac 


cttacctggc 


960 


cttgacatcc 


acggaattcg 


gcagagatgc 


cttagtgcct 


tcgggaaccg 


tgagacaggt 


1020 


gctgcatggc 


tgtcgtcagc 


tcgtgttgtg 


aaatgttggg 


ttaagtcccg 


caacgagcgc 


1080 


aacccttatc 


ctttgttgcc 


agcgattcgg 


tcgggaactc 


aaaggagact 


gccggtgata 


1140 


aaccggagga 


aggtggggga 


tgacg tcaag 


tcatcatggc 


ccttacggcc 


agggctacac 


1ZUU 


acgtgctaca 


atggcgtata 


caaagagaag 


cgaactcgcg 


agagcaagcg 


gacctcataa 


1260 


agtacgtcgt 


agtccggatc 


ggagtctgca 


actcgactcc 


gtgaagtcgg 


aatcgctagt 


1320 


aatc 












1324 



<210> 17 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer TET-lFP-1 

<400> 17 

gggtgcgcat gatcctctag agt 23 

<210> 18 
<211> 8814 
<212> DNA 

<213> Pantoea agglomerans strain DC404 
<400> 18 

accgcgaaga cagcaacgtg ctggagaccc gctttgagac ggaaccgggt cggtgctgat 60 
caccgagtcg ctgaacagca cgcttgctgg ccggctgccg tggagcgaac tggcccgccg 120 
catcgacggt attgagggcc acgtgacgct gaacgtcagc ctgcgtttcg gtaccgctgc 180 
cgagacgcgc tccccgtgga gggcgaacac ctttaagggc gatgtgtttc acattgccga 240 
tctgatggcg atgctgcgca ccagcgaaga cattgagatt acccactgcg acgatgaaaa 300 
aattaccgcc cagctgatga cctcaccggg gtcgcgctcg ctggtcgccc tgctggtcac 360 
cgagaaagag ccgctggcgg tgccggatct ctccgccatc gatgaccgca tcgaaaccag 420 
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tggtgcgact 


ggacccgcag 


CL2365USNA 
cctcagctac 
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cgcggtctct 


acgacaagca 


480 


cgtcaaacga 


tccgcgctgg 


cgctgaagtt 


tctctggtac 


tccccgaccg 


gcgcgctggc 


540 


ggcggcggcc 


accacctcgc 


tgccggaagg 


cattggcggg 


gagaaaaact 


acgactaccg 


600 


ctatgcctgg 


gtgcgcgatg 


cctgtctgat 


catcaaagcg 


ttcgtgttcc 


tcggtgcgct 


660 


ggaggactgc 


aaagccgcct 


tctcctggct 


gtcgaaaacc 


attattcgcc 


acgggcctga 


720 


gctgcgcgcc 


tgctatacgc 


tcgaaggtga 


cgaggtgccg 


gccgagtact 


atccgccgct 


780 


gcgcggatat 


cgggattccc 


gcccggtgcg 


ggtgggcaac 


aacgcccgca 


accagatcca 


840 


gctcagcatg 


tacggcgaca 


tgctcgccac 


cgcgcagctg 


tttatcgagg 


cgggacacgt 


900 


actggatctc 


gccacctcgc 


gcctgcttgg 


cgaactggcg 


aactgctgcg 


ccgacagctg 


960 


gcggcagaag 


gactccggca 


tctgggagtt 


accggacgag 


cagcactata 


cccactcgaa 


1020 


gatggcctgc 


tggctggcac 


tggatcgcgc 


cgtggcgatg 


gcagaacaga 


agcacatcga 


1080 


accgacctgg 


gtcgggcgct 


ggcagcgcga 


gcgcgatcgg 


atccgcgact 


ggatcgaaac 


1140 


ccactgctgg 


tcggagaaaa 


agcaggccta 


cgtgttttac 


gtcggggacg 


acgagcggct 


1200 


ggatgccgcg 


ctggcgctgg 


tgcacgacta 


cggcaacagc 


gtaaacccgc 


agcgtatgct 


1260 


ggccacctat 


cgcgccatca 


aagcggagct 


gggacacgac 


acgcccatgc 


tctaccgcta 


1320 


cagcgaggtg 


gaaaaggaag 


aaagcacctt 


tgtcgcctgc 


tcgttctggc 


tggtggaagc 


1380 


cctcgccgcg 


atgggtgaaa 


ccgacgaggc 


ccaggcggcc 


atgaccggca 


tcctcgagag 


1440 


gctctgcgac 


cggggcaatg 


ttgaaacttt 


taacgagatg 


tttgatgtgc 


gtaccgacga 


1500 


gtggcgcggc 


aaccttcctc 


aggggctgag 


ccatctggcg 


ctgatctgcg 


ccgcgcaggc 


1560 


gctttcggaa 


aaatgccgca 


acacgcgcga 


ctgacgcacg 


cgtagctaag 


gagaagacga 


1620 


tgaccatcag 


aggtatcgaa 


catattggta 


ttaccgtcgc 


cgacctcacc 


ggggcggagc 


1680 


ggtttttcat 


cgaggcgctg 


gatgccagcg 


tgctctaccg 


catcgtgccg 


cccggcgcgg 


1740 


cggacaatgc 


catcagcggc 


gaccagatga 


cgcggctcaa 


tggctttccc 


ccggagatgc 


1800 


gggttaccgg 


cctggccatg 


ctgcgtctcg 


gcaacggctg 


caatattgag 


ctgtttgaga 


1860 


tcgatcccgg 


cgtggcagac 


gcgcccggaa 


atatcagcca 


ggcgggcctg 


aaccacctgt 


1920 


cggtttacgt 


ggacgacatt 


cagcaggccg 


gcgcacgggt 


aaaagcacag 


ggcgccacgc 


1980 


tgtttgacgg 


gccgagcgac 


tgctttgctc 


aggaagaggg 


ccgcggcaac 


cagacctggt 


2040 


tctgccgcac 


gccttttggc 


ctgctgattg 


aactcatctc 


ccttccctcg 


ccgcttcgct 


2100 


acgatgcgca 


ggcgcagcaa 


acccgctgga 


tcccccagcg 


ctgacaggcc 


tctctcacgc 


2160 


gggcatcgcc 


cgcgttgtca 


taccctcgtc 


accgtcctga 


caaaaattaa 


caataaattt 


2220 


tcatttttca 


□ccaaacttt 




COt COCCata 


acatttattt 

CI V— CA L L L CI C l_ l_ 


tcatctaaar 


2280 


ctatacaaga 


aaaacattga 


tgtataactt 


tgcataccgc 


tgcacacagg 


ctcagactgc 


2340 


gacacccgtt 


gcgggtcagc 


gctatttcca 


tttcatctgc 


gagacgccgg 


gtaccaacca 


2400 


tgacaagacc 


ctttgaaaca 


catcccggtc 


acgacgggga 


actgcatgag 


ctgcacgctg 


2460 
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ccctgcaacg 


tcgcctggat 


gaactgctgc 


CL2365USNA 
ccgttggcga 


.ST25 

tgagcgggat 


cgggtcagca 


2520 


gcgcaatgcg 


cgaaggcgta 


ctggcaccgg 


ggaaacgcat 


tcgcccgctg 


ctcctgatcc 


2580 


tcgccgcccg 


cgacctcggc 


tgcgatcgcg 


accaccccgg 


cctgctggat 


atggcctgtg 


2640 


cggtggaaat 


ggtgcacgcc 


tcgtcgctga 


tcctcgacga 


tattccctgc 


atggataacg 


2700 


cggcgctccg 


gcgcggtcgc 


cctaccattc 


atcgccagta 


tggtgaagac 


gtggcaattc 


2760 


tcgctgcggt 


agcgttgctc 


agcagcgcct 


ttggcgtgat 


ggtcgcggcg 


cagggattgt 


2820 


ctcccgagtg 


ccgcagccag 


gcggtggcgg 


agctgtcgat 


ggcggtcggt 


acccagggtc 


2880 


tggtgcaggg 


tcagtataag 


gatctgcgtg 


aaggcaccgc 


cccgcgcagc 


gccgaggaga 


2940 


tcgccaccac 


caacgaactg 


aaaaccagcg 


tgctgtttgg 


tgccacgctg 


caaatcgcgg 


3000 


ccctggcggc 


aggcgcctcg 


ccggcggcgc 


gccagaaaat 


gcgctgcttt 


gcgcaggatt 


3060 


taggccaggc 


gttccagctg 


ctggacgatc 


tggcggacgg 


ccatgccggg 


accggcaaag 


3120 


acatcaataa 


ggacgcgggt 


aagtccacgc 


tggtggcgat 


gctcggcagc 


gacgcggtgc 


3180 


gcgagcggct 


cgacacccat 


ctgcgccgcg 


cagacgccca 


tttttcacgc 


gcctgcggaa 


3240 


aaaaccaggc 


cacgcgacgc 


tttatgcacg 


cctggttttc 


aaaacagctg 


gccgcgttta 


3300 


gctgagcaac 


ggatacaccc 


cggtaatatt 


tgtggagatc 


acatgaagga 


cgcgcatctg 


3360 


gttcagcgta 


aaaatgacca 


cctggatatc 


gtgctgcacc 


ctgaccgggc 


gatgagtacc 


3420 


attcgcaccg 


gatttgacgc 


ctggcgtttt 


gaacactgcg 


ccctcccgga 


gctggatctc 


3480 


gacggtatcg 


atctctccac 


caccctgttt 


tcccgcccgc 


tgaaagcccc 


ggtgctgatc 


3540 


agctccatga 


ccggcggcgc 


ggcgcgcgcc 


agagacatta 


accgtcatct 


ggcccaggcg 


3600 


gcgcaaaccc 


ttgggctggc 


gatgggcgtc 


ggttcccagc 


gcgtggcgct 


ggaggacggc 


3660 


gcgcagcacg 


ggctggatgc 


ccagctacgc 


catatcgccc 


cggacgtgcc 


gctgctggct 


3720 


aaccttggcg 


cggcgcagat 


ccgcggtgcg 


caggggctgg 


actacgcccg 


gcgcgcggtg 


3780 


gacatgatcg 


acgccgacgc 


gttaattgtg 


catctgaacc 


cgctgcagga 


ggcgctccag 


3840 


ggcggcggcg 


atcgcgactg 


gcgcggcatc 


ctcaacgcca 


ttgcgcagct 


ggtgcgcgac 


3900 


ctgccggtac 


cggtggtggt 


taaagaggtg 


ggcgccggga 


tctccccgga 


cgttgcctgc 


3960 


cgactggcgg 


acgtcggcgt 


ggcgatgatc 


gacattgccg 


gcgcgggcgg 


aaccagctgg 


4020 


gcggcggtgg 


aagctgaacg 


cgccccgacc 


cccgaggcgc 


gaaatgtggc 


gatggccttt 


4080 


gccgactggg 


gcattcctac 


tgccgatgcg 


ctgcgtcgcg 


tccatcttgc 


gctgcctgat 


4140 


atcccgctta 


tcgcctccgg 


cggcatcgcc 


aacggcattg 


acgcagcaaa 


agccatcgcg 


4200 


ctgggtgcag 


atctggtggg 


ccaggccgcg 


gcggtgctgg 


cgcatgccaa 


cgcctccggc 


4260 


yciv_y*wyyv_cici 


ttnrrrattt 


v_v_y*_cn_v_v_ i_y 


d L L d i_ y \w d. y L_ 


i_y v_y y ct l v-y i_. 


r +- n 1- 1- 1- r 1- rt 1- 




accggcagtg 


caaacctgca 


ggcgttgcga 


cacgccacgc 


tgcttccggt 


caacggcggc 


4380 


gcatccctgt 


gacgcagtac 


ggtgccttat 


accggggagc 


ggtatgaaaa 


aatgggatct 


4440 


gattctggtc 


ggcgcggggc 


tggccaacgg 


gcttatcgcc 


tggcgactaa 


agcagcgtca 


4500 
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tccgacgctt 


gctgtattaa 


tgctggagtg 


CL2365USNA 
cggcgacgcg 


.ST25 

cccggcggaa 


accacacctg 


4560 


gtcctttcac 


caacacgata 


tcacgccagc 


ccagcacgcc 


tggctggcgc 


cgctggtggc 


4620 


ccatcgctgg 


gaegggtacg 


acgtccactt 


tecgaaegtg 


tcgcgcaccc 


tgeatgaegg 


4680 


ctacctgacc 


atcacctcca 


cgcgttttgc 


ecaagegatg 


cgcgggctga 


tgaaagagaa 


4740 


tttgctgaca 


aacgtgaccg 


tgtcacgggt 


gagegggcag 


gaagtaaccc 


teagegaegg 


4800 


acgacgcttt 


accgccgggg 


cggtgattga 


tggccgcggc 


tatcagccct 


cgccgcacct 


4860 


cagcattggc 


tatcaggegt 


tcatcggcca 


ggagtggcaa 


ctgaccgcgc 


cccacgggtt 


4920 


aacgcgcccg 


atcctgatgg 


atgcccgcgt 


cgcccagggc 


aacggctacc 


gctttgtcta 


4980 


taccctgccg 


ctcagcgccg 


acaccctgct 


tatcgaagac 


acgcactaca 


ttgacggccc 


5040 


gacgctcgac 


gecgattcag 


cccgcgcgcg 


gattgecgat 


tacgcccgcc 


agcagggctg 


5100 


gcagcttgcg 


cggctggtgc 


gtgaggaaca 


gggggcgctg 


ccgatcaccc 


tgtceggega 


5160 


tccggccgcc 


ttctggcacc 


agttccatca 


tcagccggtc 


agcggcctgc 


gcgccggtct 


5220 


gttccatgcc 


accaccggct 


attcgctgcc 


gctggcggtt 


eggctggegg 


accgcattgc 


5280 


caacgcgccg 


ggactgeate 


agggegeget 


ctatcagctg 


ategecgatt 


tcgcggcgcg 


5340 


ccactggcag 


acacaacgct 


ttttccgcct 


gcttaaccgc 


atgettttec 


tggccggcac 


5400 


acccgaccag 


cgctggcgcg 


tgatgcagcg 


gttttaccag 


cttgacgagc 


agetgatege 


5460 


ccgtttttat 


gccggccagc 


ttcgctccgc 


cgaccgcgcg 


cgcctgctgc 


ttggcaaacc 


5520 


gccggtgccg 


attgtcgggg 


cgatcaaagc 


cctgctccac 


actcattctt 


ctctgcgagc 


5580 


ccatcataaa 


tgaaacaaac 


cattgtaatt 


ggcgccgggt 


teggeggact 


ggcgctggcg 


5640 


attcgcctcc 


aggeggeggg 


cattcctacc 


aegctgetgg 


agagecgega 


caaacccggc 


5700 


ggccgcgcct 


atgtctacga 


agatcgegge 


tttacctttg 


atgegggtec 


caccgtcatc 


5760 


accgatccct 


ccgccattga 


ggagctgttc 


accctcgccg 


gaaaaegget 


gaaggactac 


5820 


gttgagctga 


tgccggtgac 


geegttctat 


cgcctgtgct 


gggaagaegg 


caaggttttc 


5880 


gactacgcca 


acgatcaggc 


ggegcttgag 


tegcagateg 


ccgcgtttaa 


cccgaacgac 


5940 


gtggcgggct 


atcaccgctt 


cctcgactac 


tcccgggcgg 


tgtttgccga 


aggctatctg 


6000 


aagctcggcg 


cggtgccgtt 


tctctcgttt 


cgcgacatgc 


tgcgcgccgg 


tcctcaactg 


6060 


gcgcggctgc 


aggcatggcg 


cagcgtgtac 


gacaaagtgt 


cggcctacgt 


ggaagacgag 


6120 


cacctgcggc 


aggcattttc 


gtttcactcg 


ctgctggtgg 


gcggcaaccc 


gttctccacg 


6180 


tcttctattt 


acaccctgat 


ccacgccctg 


gagegggaat 


ggggcgtctg 


gttcccgcgc 


6240 


ggcggcaccg 


gtgcgctggt 


tcagggcatg 


gtgaagctgt 


ttcaggatct 


tggcggcacc 


6300 


\~ LLClvLL 1. La 


uLyL i_ v_ ay y l. 


i_y ay Ly y l i_ y 


y ay aLyy l y y 


aLad LL.ciyy L 


yctciyy (_<_y i»y 


Dj \j\J 


catctggtta 


aegggcageg 


gctggaggct 


gcggcggtgg 


cctcgaacgc 


ggacgtggta 


6420 


aatacctatg 


cccgactgct 


cggccatcac 


ccgcacggcg 


ccgctacggc 


caaaaagctg 


6480 


aaaegcaage 


gcatgagcaa 


ctcgctgttc 


gtgetctatt 


ttggcctgga 


tcaccatcac 


6540 
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acccagctgg 


cgcaccatac 


cgtctgcttt 
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ataaagcgct 


aatcgatgaa 


6600 


attttcagcg 


ccgacaccct 


gtcggaagat 


ttttcgctct 


atctgcatgc 


gccctgcgta 


6660 


accgacccgt 


cgctggcccc 


gccggggtgc 


ggcagctact 


atgtgctcgc 


gccggtgccg 


6720 


cacctcggta 


acgccccgct 


cgactggagc 


gtggaagggc 


cgcgtctgcg 


ggatcgcatt 


6780 


tttgattatc 


tcgaagcgcg 


ctatatgccg 


gggctgcgct 


cccagctggt 


gacgcaccgc 


6840 


atgttcacgc 


cggaagattt 


tcgcgatacg 


ctcgatgcct 


ggcaggggtc 


agcgttttca 


6900 


ctggagccga 


tcctcaccca 


gagcgcctgg 


ttccggccgc 


acaaccgcga 


cagcgtggtt 


6960 


gataacctct 


acctggtcgg 


cgccggaacg 


catcccggcg 


ctggcgtgcc 


gggcgtgatc 


7020 


ggatccgcca 


aggcaacggc 


ccagttaatg 


ttaaaggatt 


tagcgtaatg 


tcccagccgc 


7080 


ttctcgaaca 


cgccagcgcc 


accatgaccg 


ccggttctaa 


aagtttcgcc 


accgcctcaa 


7140 


agctgtttga 


caaacgcacc 


cggcgcagcg 


cgctgatgct 


ctatacctgg 


tgccgctact 


7200 


gcgacgatgt 


tatcgacgga 


caggtggtgg 


gttttgctgc 


cccgaccgag 


cagagcgaca 


7260 


cgcccgaggc 


gcgcctgcaa 


cggctgcgta 


agatgacgcg 


ccgcgcctac 


gacggggaaa 


7320 


ccatgcaaga 


gccgccgttc 


gccgcctttc 


aggaggttgc 


cctcgcccat 


gccattccgc 


7380 


ctactcaggc 


cttcgaccac 


ctggaaggct 


atgcgatgga 


cgtgcgcaac 


gagcgctatt 


7440 


acagcctcga 


tgatacgctc 


cgctactgtt 


atcacgtggc 


gggcgtggtc 


ggcctgatga 


7500 


tggccagggt 


gatgggagtg 


cgggacgaag 


ccacgctgga 


tcgcgcctgc 


gatctgggca 


7560 


ttgcctttca 


gctcaccaat 


atcgccaggg 


atatcgttga 


cgatgcgcag 


gtgggacgct 


7620 


gctacctgcc 


gcagcagtgg 


ctggcggaag 


tcggactcaa 


tgaacagacc 


tgcaccgtgc 


7680 


gggccaaccg 


tccggcgctg 


gcgcgtctgg 


cagcgcggct 


ggtgaccgag 


gctgagccct 


7740 


attatcagtc 


agcgcttgcc 


gggctggggg 


atctgcccct 


gcgctccgcc 


tgggcgattg 


7800 


ccaccgcgca 


cggggtgtat 


cgtgagatcg 


gggtgaaggt 


gctgatggcg 


ggtgaaaaag 


7860 


catgggatac 


ccgccagggc 


acgacgcgcg 


cggagaagct 


ggcgctggtt 


atttccggcg 


7920 


cgaagcaggc 


gatggcttcc 


cggaaggcga 


gctggccgcc 


gcgcgatccg 


cacctctggc 


7980 


agcgcccgcg 


ctagcgggtc 


tgccgttacg 


ttcgcgcagc 


accgcctgca 


gcttgtccac 


8040 


cggtggggcg 


tagataaacc 


cgaaggagac 


gcacccttcg 


cgcccccgca 


ccgcgtgatg 


8100 


cagccggtgt 


gccatgtaga 


ggcggcgcag 


atagccgcgg 


cgcggcacgt 


aacggaacgg 


8160 


ccagcgctgg 


tggactaaac 


catcgtgaac 


gataaagtag 


atcacgccgt 


agccggtcat 


8220 


tcccgcgcca 


atccactgaa 


gcggccagta 


cccttcgctg 


cccgcgtaaa 


tcagcgcaat 


8280 


ggccagtagc 


gcaaacacca 


ccgcatagag 


atcgttacgc 


tcaaacgccc 


ctttgcgcgg 


8340 




y ua Lya l y v. ^. 


aorrrratrr 


\_ v_ ay v_ y i_ y v_ 


c* Lya. ty loll 


i_ y Ly i_y v_y del 


8400 


cgttgccacc 


ccttccatga 


tgatgatagt 


cagtagcacg 


atcccggtat 


tccacaacgc 


8460 


aagcataggt 


ttttcctgta 


gttgacagcc 


cctaaagcgt 


agcctggaat 


gecaggaaac 


8520 


ataagcgtaa 


cctcggggat 


aatgcgcttt 


tcaggcgtaa 


aagcatttat 


gacaattatt 


8580 
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catcgcgcca cgttcacgcc gtgacgccct gctcaccgcg cggcagcagc cgcatcggct 8640 

gataaacgcg cccggtttct gcgcgtcatc gcccggtgtg cgcggcgtca acgcaataaa 8700 

acttactttc aaaaggcggc ccgaaaaggc tacccttttt tattcttgtc atatactcga 8760 

tctaacctga attatcgccg taacgtaccg cttcttttga ggtaatcccg gage 8814 



<210> 19 

<211> 34 

<212> DNA 

<213> arti f i ci al sequence 
<220> 

<223> Primer 

<400> 19 

gaattcacta gtcgagaege egggtaccaa ccat 



<210> 20 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Primer 

<400> 20 

gaattctagc gcgggcgctg ccaga 



34 



25 
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